Mining Frequent and Similar Patterns with Attribute Oriented Induction High Level Emerging Pattern (AOI-HEP) Data Mining Technique
نویسنده
چکیده
Attribute Oriented Induction High level Emerging Pattern (AOI-HEP) is a novel idea which is influenced by Attribute Oriented Induction (AOI) and Emerging Pattern (EP). AOI-HEP discovers patterns such as Total Subsumption HEP (TSHEP), Subsumption Overlapping HEP (SOHEP) and Total Overlapping HEP (TOHEP), include frequent and similar patterns. Mining TSHEP, SOHEP, TOHEP, frequent and similar patterns for each dataset is influenced by learning on high level concept in one of chosen attribute. The experiments used four datasets from UCI machine learning repository and most datasets have SOHEP but not TSHEP and TOHEP and the most rarely found were TOHEP. There are total twenty two High level Emerging Pattern (HEP) where four HEP are TSHEP, sixteen HEP are SOHEP and two HEP are TOHEP, and there are five frequent and four similar patterns from the experiments. Moreover, the experiment showed that adult and breast cancer datasets are interested to mine frequent pattern while breast cancer and IPUMS datasets are interested to mine similar pattern. However, census dataset is not interested to be mined for both frequent and similar patterns. AOI-HEP is suitable for dealing with large dataset since can handle million tuples in dataset in one digit seconds.
منابع مشابه
A Novel Technique for Pattern Extraction in Mixed Data
Knowledge discovery in databases or data mining is an important issue in the development of data and knowledge base system. The Self Organizing Map (SOM) is a vector quantization method which places the prototype vectors on a regular lowdimensional grid in an ordered fashion. Clustering data and extracting patterns from the clusters are very important tasks in data mining. An attribute-oriented...
متن کاملHigh Fuzzy Utility Based Frequent Patterns Mining Approach for Mobile Web Services Sequences
Nowadays high fuzzy utility based pattern mining is an emerging topic in data mining. It refers to discover all patterns having a high utility meeting a user-specified minimum high utility threshold. It comprises extracting patterns which are highly accessed in mobile web service sequences. Different from the traditional fuzzy approach, high fuzzy utility mining considers not only counts of mob...
متن کاملA New Algorithm for High Average-utility Itemset Mining
High utility itemset mining (HUIM) is a new emerging field in data mining which has gained growing interest due to its various applications. The goal of this problem is to discover all itemsets whose utility exceeds minimum threshold. The basic HUIM problem does not consider length of itemsets in its utility measurement and utility values tend to become higher for itemsets containing more items...
متن کاملExploration of the Power of Attribute-oriented Induction in Data Mining
Attribute-oriented induction is a set-oriented database mining method which generalizes the task-relevant subset of data attribute-by-attribute, compresses it into a generalized relation, and extracts from it the general features of data. In this chapter, the power of attribute-oriented induction is explored for the extraction from relational databases of diierent kinds of patterns, including c...
متن کاملFAT-CAT: Frequent Attributes Tree Based Classification
The natural representation of XML data is to use the underlying tree structure of the data. When analyzing these trees we are ensured that no structural information is lost. These tree structures can be efficiently analyzed due to the existence of frequent pattern mining algorithms that works directly on tree structured data. In this work we describe a classification method for XML data based o...
متن کامل